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(57) The present invention provides a data proces- 
sing system, and method of operating such a 
system, for facilitating a connection of a prog- 
ram on a client computer to a server, the server 
consisting of a plurality of server computers 
with shared resources. The data processing 
system, the client computer, and the server 
computers all reside in a logical network. The 
data processing system has an input means for 
receiving a request from the client computer for 
a machine address of a server computer iden- 
tified by a server computer name sent with the 
request, such a machine address enabling a 
connection to be made from the client conv 
puter to that server computer via the networtc, A 
storage device is provided by the system for 
storing a list identifying server computer names 
with machine addresses of the server compu- 
ters. A conversion means in the system uses the 
list to convert the server computer name re- 
ceived by the input means into the machine 
address of the server computer, and then an 
output means sends the machine address from 
the conversion means to the dient computer. 
The syst m is characterised by decisi n logic 
for studying th s rv r computers at predeter- 
min d int rvals having regard to a predeter- 
mined test criteria, in order to select one of the 
server computers ; and writing m ans for up- 
dating the list by associating the machin ad- 
dress for the s rver computer sel cted by th 
decision logic with a particular s rver computer 
name contain d as a g neric s rver comput r 



name in the list Using this technique, when a 
dient computer specifies the generic server 
computer name, it receives the machine 
address of the server computer identified by the 
decision logic. 
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The present invention relat s to a data process- 
ing system for facilitating the conn ction of a program 
on a cli nt computer to a server, the server consisting 
of a pluralityof server computers with shared resourc- 
es. ' 

The data processing system, the dient computer, 
and the server computers are all resident on a net- 
work. This network need not be ono physical network 
such as a Local Area Network (LAN); for instance it 
may consist of a number of LANS or WANS (wide area io 
networks) connected together (eg. via 'bridges') to 
form a single logical network. However the same net- 
work protocol will be employed throughout the net- 
work, a typical example of such a protocol being 
TCP/IP which wilt be familiar to those skilled in the art. is 

In many environments it is increasingly found that 
a number of server computers are connected togeth- 
er using some form of network, eg. a LAN. It is often 
the case that several users of client computers will be 
connected to one such server computer, whilst other 20 
server computers stand idle. An example of this is 
when such computers are situated in workers' offic- 
es, connected by. for example, a token ring LAN using 
the standard TCP/IP network protocol. When workers 
are away from their offices, their computers will usu- 25 
ally stand Idle. 

In such situations it is commonly the case that a 
few of the computers in the network are heavily load- 
ed, whilst other computers in the network are very 
lightly loaded, giving poor response and performance 30 
for the client computers using the heavily loaded ser- 
ver computers. Hence there is a problem of how to en- 
able the client user load to be spread more evenly 
across the available computing resources of the ser- 
ver in a manner which is transparent to the client com- 35 
puter and its programs. If transparency is to be ach- 
ieved, standard protocols need to be observed in or- 
der that client computers can use a variety of connec- 
tion methods without modification of any programs 
being required. 

A prior art technique which has been developed 
to provide some sort of load spreading is called "Static 
load levelling". With this technique each application 
on each client computer has a designated server to 
which it always connects. Hence, for example, if there 45 
are 200 potential clients of a server having five server 
computers, a pre-specif ied group of, say, 40 of the cli- 
ents will be told (or configured) to always connect to 
machine 1 , etc. On average it may be argued that this 
will give a reasonably even load across all of the ser- 50 
ver computers. However in practice it is often the 
case that, using this technique, a larg numb r of 
users of client computers are connected to one s rver 
computer, while an adjacent s rver computer stands 
completely idle ( g. giv n the above example th re 55 
could easily be 40 us rs on one s rver computer 
whilst th other 4 s rver comput rs stand idle). 
Hence in situations wh re th user loading chang s 



from time to time, th prior art static load levelling 
technique is not particularly satisfactory. What is r - 
quired in such instances is a more 'dynamic' techni- 
que which can r spond t changing user loads, and 
thus dir ct n w users to the most suitable (eg. least 
heavily loaded) server computers in the server. 

Other prior art techniques can be found in other 
environments, such as those where job allocation is 
an issue. For instance in "batch processing", a client 
computer submits an encapsulated task to a central 
server, which determines which one of several possi- 
ble servers is quiet enough to be able to handl the 
task. The task Is sent to that server, is processed, and 
the results are then sent back to the dient (e.g. as a 
results file, or by electronic mail). With a batch pr c- 
essing system, there Is a brief connection to the cen- 
tral server while the job Is transferred from the dient 
to the server allocated by the central server. After this 
brief connection the dient disconnects, and has no 
more interaction with the submitted task until it has 
been completed, and the results have been passed 
back to the dient by some means. 

However in the situation with which we are cur- 
rently concerned, the dynamic load levelling techni- 
que that is required must be able to deal with "inter- 
active" sessions. Rather like a phone conversation, 
the connection between the program on the di nt 
computer and a particular server computer will p rsist 
for the duration of the "conversation" session. H nee 
the batch processing concept is inappropriate in th 
present situation. 

It is possible to write some specific code within a 
program on a dient computer which contains internal 
message-passing systems to route work from that di- 
ent program to a corresponding server program. Such 
systems are dedicated only to that particular di nt 
program, and the connection and load-levelling meth- 
ods are not accessible to other dient-server applica- 
tions. Often, such systems operate by the client con- 
necting to a specific "host" server computer, andfr m 
there the work will be sent to another server for proc- 
essing. Clearly this technique can result in large bot- 
tlenecks arising at the "host" server computer. 

Hence such a technique is not suitable in th 
present situation since it only supports one very sp - 
cific type of dient-server connection, whereas we 
need a technique that will allow any dient-server con- 
nection method using the network protocol to be con- 
nected to a quiet server in a way that is completely 
transparent to the dient program. Further the abov 
technique relies on an initial connection to the 'cen- 
tral' host server computer, which then passes the r - 
quest on to another server computer as described 
abov this can potentially create a serious bottle- 
n ck. 

It is an object of th pr sent invent! n to provide 
a techniqu which facilitates a connection betw en a 
cii nt program and a server comput r on a s rver in 



3 



EP 0 648 038 A2 



4 



a way that takes into account the current status of the 
server computers forming the server This technique 
must observe standard network protocols and should 
op rate in a manner which is transparent to the cli nt 
program requesting access. 5 

Accordingly the present invention provides a data 
processing system for facilitating a connection of a 
program on a client computer to a server* the server 
consisting of a plurality of server computers with 
shared resources, the data processing system, the io 
client computer, and the server computers residing in 
a network, the system comprising: input means for re- 
ceiving a request from the client computer for a ma- 
chine address of a server computer identified by a 
server computer name sent with the request, such a f5 
machine address enabling a connection to be made 
from the dient computer to that server computer via 
the network; a storage device for storing a list identi- 
fying server computer names with machine address- 
es of the server computers; conversion means for us- 20 
ing the list to convert the server computer name re- 
ceived by the input means into the machine address 
of the server computer; output means for sending the 
machine address from the conversion means to the 
client computer the system being characterised by: 25 
decision logic for studying the server computers at 
predetermined intervals having regard to predeter- 
mined test criteria, in order to select one of the server 
computers; and writing means for updating the list by 
associating the machine address for the server com- 30 
puter selected by the decision logic with a particular 
server computer name contained as a generic server 
computer name in the list; whereby when a client com- 
puter specifies the generic server computer name, it 
receives the machine address of the server computer 35 
identified by the decision logic. 

Typically the conversion means will access the 
list from a local piece of storage, the data processing 
system having a copier to copy the list from the stor- 
age device to that piece of memory. In preferred em- 40 
bodiments the data processing system further com- 
prises a messaging means, responsive to the updat- 
ing of the list by the writing means, for sending a mes- 
sage to the copier requesting the copier to copy the 
updated list into the piece of local memory. 45 

Any manner of predetermined test criteria can be 
used in the data processing system of the invention, 
for example the amount of idle processor time, the 
number of processes running, the amount of free 
memory, the load average", etc. However in prefer- so 
red embodiments the predetermined test criteria are 
such that the decision logic id ntifi sth servercom- 
puter having the I ast numb r of cli nt programs log- 
ged on to it. 

In preferred embodim nts the pr determin d in- 55 
tervals are variable and will eith r be set by a user of 
the system, eg. th system administrator, or will be 
adjusted dynamically. The us r will also set the pre- 



determined test criteria to be used by the d cision log- 
ic. 

Further in preferred mbodlments th user can 
limit th number of server computers which the deci- 
sion logic studies. This may be us ful if, forinstanc . 
some of the server computers have not got access to 
all of the resources that other server computers hav 
access to, and so would not be suitable as s rver 
computers to be associated with the generic serv r 
computer name. 

In some embodiments it may be advantageous to 
use a plurality of generic names. Each server name 
would then have a number of server computers 
whose machine addresses are associated with that 
generic name, the decision logic employing different 
sets of predetermined test criteria for each generic 
name. In such embodiments one or more of the s r- 
ver computers can be associated with a plurality of 
the generic names. 

Viewed from a second aspect the present inv n- 
tion provides a method of operating a data processing 
system to facilitate a connection of a program on a cli- 
ent computer to a server, the server consisting of a 
plurality of server computers with shared resources, 
the data processing system, the client computer, and 
the server computers residing in a network, the m tri- 
ed comprising the steps of: (a) receiving a request 
from the client computer for a machine address of a 
server computer identified by a server computer 
name sent with the request, such a machine address 
enabling a connection to be made from the cli nt 
computer to that server computer via the network; (b) 
storing a list identifying server computer names with 
machine addresses of the server computers in a stor- 
age device; (c) converting, with reference to the list, 
the server computer name received at step (a) into 
the machine address of the server computer; (d) 
sending the machine address Identified at step (c) to 
the client computer the method being characteris d 
by the steps of: (e) employing decision logic to study 
the server computers at predetermined intervals hav- 
ing regard to predetermined test criteria, in order to 
select one of the server computers; and (f) updating 
the list by associating the machine address for th 
server computer selected by the decision logic with a 
particular server computer name contained as a g n- 
eric server computer name in the list; whereby wh n 
a client computer specifies the generic server com- 
puter name at step (a), it receives the machine ad- 
dress of the server computer identified by the deci- 
sion logic. 

The pr sent invention will be d scrib d further, 
byway fexampi nly, with ref re nee to an em bod i- 
m nt th rof as illustrat d in th accompanying draw- 
ings, in which: 

Figure 1 is a block diagram illustrating th data 

pr c ssingsyst mofthepref rred mbodiment; 

Figur 2 is a flow diagram illustrating h wthed - 
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cision logic in the data processing system of the 
preferred mbodiment operates; and 
Figure 3 illustrat s a particular embodiment 
where two generic computer names ar used. 
In the preferred embodim ntw willconsid rthe 5 
situation wh re th server in question Is a high p r- 
formanc database serv r which has its data distrib- 
uted across a network of server computers, this ser- 
ver network hereafter being referred to as a cluster. 
Database applications being run by users on client io 
computers are required to connect to one of the ser- 
ver computers in the cluster to enable them to access 
the data in the database server. By the nature of the 
database system, it does not matter which server 
computer the client connects to - all of the data is ac- is 
cessible from any server computer in the cluster. In 
the preferred embodiment the server computers and 
the client computers are all interconnected using 
TCP/IP on a token ring Local Area Network. 

For a large number of users, it is highly desirable 20 
to have a number of users on each of the server com- 
puters in the server cluster, rather than all users con- 
necting to (and hence overloading) just one or a few 
of the server computers. With a widely varying user 
workload profile for the database server this problem 25 
can only be solved by providing some form of "load 
levelling" process, which will allocate new client appli- 
cation instances to server computers in the cluster 
that are most suitable for the client connection (eg. 
because they are more lightly loaded than other ser- so 
ver computers). Clearly this process must be dynam- 
ic, able to respond to changing load conditions over 
time. Since the database applications on the client 
computers are typically complex and often are sup- 
plied only in object code form, it would be very difficult 35 
(or impossible) for the system administrator to alter 
them, and so it is essential that this allocation is done 
in a manner which is entirely transparent to the client 
application. 

The manner in which the data processing system 40 
of. the preferred embodiment solves the above prob- 
lems will now be described with reference to Figure 1 . 

Each client computer in a network using the 
TCP/IP protocol (there will typically be many such cli- 
ent computers) will have been informed by the net- 45 
work administrator that it is to communicate with a 
particular computer when it wishes to convert a conv 
puter name of another computer in the network into a 
machine address. When utilising the present inven- 
tion that computer will be the data processing system so 
of the preferred embodiment. 

Hence when a program running on a client com- 
puter 20 (for clarity, only n client computer is illu- 
strat d) wishes to obtain acc sstoas rv rcomputer 
(40. 50, 60) in the dust r it will c mmunicate with the 55 
data processing syst m 10 in rder to btain a full In- 
tern t machine address for the d sired serv r (Inter- 
net addressing is part of the TCP/IP protocol). With 
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the prior art technique the client computer would 
specify a server computer nam in this c mmunica- 
tion that was specific to one particular server comput- 
er in the duster. Th input means 30 of the data proc- 
essing system 10 would receive this s rver computer 
name and pass it to the conversk>n m ans 70. 

In a storage device 80 of the data processing sys- 
tem a list is maintained which identifies server com- 
puter names with particular Internet addresses. 
When the conversion means is initiated the copier 90 
copies this list from the storage device 80 into a piece 
of local memory 100 accessible by the conversion 
means 70. Hence the conversion means will access 
the list in memory 100 to find the Internet address of 
the computer associated with the server computer 
name passed to it by the input means 30. This Internet 
address will then be provided by the conversion 
means to the output means 11 0 for transmissbn back 
to the client computer 20. 

Once the dient computer has the Internet ad- 
dress it can then make direct contact with the server 
computer residing at the Internet address provided; in 
Figure 1 this is server computer 2. Since the TCP/IP 
protocol is used any of the access methods that use 
this protocol can be used to access the server com- 
puter. 

When using the data processing system of th 
preferred embodiment the program running on the di- 
ent computer 20 will not use the server computer 
name that it previously used. Instead a generic serv r 
computer name will be used. This generic serv r 
name will either have been placed in the program's 
configuration file, or alternatively the user of the pro- 
gram will specify the generic name when running the 
program. 

Within the data processing system, decision logic 
120 is provided which periodically studies the server 
computers in the duster having regard to some pre- 
determined test criteria, hereafter called the metric 
string. In the preferred embodiment the metric string 
is a list of questions which when answered by the va- 
rious server computers will enable the decision logic 
to decide which server computer is most suitable for 
a client connection (the most suitable perhaps being 
the least heavily loaded server computer). The metric 
string can be altered as the system administrator 
deems appropriate* depending on what criteria the 
administrator wishes to be used to select a server 
computer. 

In the preferred embodiment the decision logic 
actually sets up a number of child processes, each 
one being responsible for sending the metric string to 
a particular server computer and receiving the re- 
sponse from the server computer. 

One th responses have b en r ceived th de- 
cisi n logic will collate the r sponses. d cide which 
server computer is most suitable, and then requ st 
th writing means 160 to pass th Int rnetaddr ssof 
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that server computer to the storage device for asso- 
ciation with the generic server computer name. If 
however th most suitable s rver computer Is th 
same server computer as that identified in the last 
iteration of the process then there is no need to up- 5 
date the storage device and the writing means will not 
be activated. 

Once any necessary update to the storage de- 
vice 80 has been made the messaging means 1 70 will 
notify the copier 90 so that the copier updates the lo- to 
cal memory 100 with the new list as stored in the stor- 
age device 80. 

Hence when the client computer requests a ma- 
chine address for the generic server computer name 
the conversion means 70 accesses the list in memory is 
1 00 and identifies a machine address just as it would 
if any other server computer name had been given. 
However in this instance the machine address actual- 
ly relates to the server computer in the cluster which, 
has been identified by the decision logic as the most 20 
suitable (eg. least heavily loaded). When this ma- 
chine address is passed back to the client computer 
20 via the output means 110, the client computer will 
proceed to automatically access the server which is 
most suitable. 25 

By this approach it will be seen that a dynamic 
load levelling facility is provided which is completely 
transparent to the client program. As far as the pro- 
gram is concerned it is requesting a machine address 
as normal and is using one of the normal TCP/IP ac- 30 
cess methods to gain access to the server computer 
allocated to It. 

In many of todays computing environments {eg 
Unix, AIX (Unix is a Trade Mark of Unix Systems Lab- 
oratories Inc)) an application is provided to perform 35 
the standard name resolution service (ie receipt of a 
computer name and conversion of that computer 
name into a full Internet address). This application is 
commonly known as a "nameserver" application, and 
is installed on one or more computers in the logical 40 
network. Every other computer in the network is told 
to communicate with a specified one of these *name- 
server' computers when it wishes to detanmine an In- 
ternet address for any other computer In the network. 
Hence a nameserver computer provides a resolution 45 
service to client computers by receiving from them a 
convenient name given to a particular computer (eg. 
abc.def.ghi.com), and converting it into a full Internet 
address (eg. 29.1.19.66). This Internet address is 
then used by the routing subsystem (TCP/IP) to allow so 
a client user or application access to the physical 
comput r ( g "abc" in this example). 

In the abov exampl of a c mputer name, "abc" 
Is the physical machin . "der is typically th site lo- 
cation, "ghi" the organisation, and "com" one of the I n- 55 
ternet classes (three such classes ar (com)mercial, 
(edu)cation, (mil)itary). Domains and sub-domains 
can also b added as part of this comput r name. Ba- 



sically the name takes a hierarchical form, with the 
finest resolution at the beginning and th coarsest re- 
solution at the nd; this typ of naming structure will 
of course b w II known to those skill d in the art. 

All TCP/tP-based applications, including remot - 
login, remote-shell, telnet, ftp, and also client-s rver 
applications (such as database applications), ar 
aware of the nameserver facility, and wilt automati- 
cally go to the designated nameserver comput r to 
ask for resolution of a computer name into an Int rnet 
address before attempting to make a connect! n to 
another computer in the network. 

If we consider Figure 1 again, the standard name- 
server facility will include the following elements: the 
input means 30, the converston means 70 with asso- 
ciated memory 100, the output means 110, th list 
stored in the storage device 80, and the copier 90. 

The nameserver application is a ^'daemon" (back- 
ground) process which runs on the data processing 
system; this data processing system may (but n d 
not) be one of the server computers forming part of 
the duster over which users are to be distributed. In 
Unix-type operating systems (eg. AIX by IBM Corpor- 
ation, Ultrix by Digital Equipment Corporation, OSF/1 
by the Open Software Foundation, and HP-UX by 
Hewlett Packard, etc) this daemon process is call d 
"named" (name-daemon), and when it is initiialis d, it 
reads a special database file (named.data) stored on 
the storage device 80 to obtain details of the comput- 
er names about which it is expected to know (over 
which it has "authority"), and the corresponding Inter- 
net addresses ("dotted decimal", e.g. 29.1.19.66) for 
each computer name. Whilst the name daemon is op- 
erating, it can be forced to re-read the information 
from the named.data database file by the sending of 
an inter-process signal to the name daemon process 
telling It to update its internal tables 100 from the da- 
tabase file (named.data). 

In the preferred embodiment of the present inv n- 
tion we provide a further facility which runs on the 
same computer as the nameserver application 
("named"), and interfaces with it. A 'generic* comput- 
er name is Introduced Into the database file 
(named.data), which refers not to one specific com- 
puter, but to any one of a number of computers offer- 
ing equivalent functionality. For example, the gen he 
name might be "server.cluster.def.ghi.com"; a cli nt 
program requesting a connection to 'server.duster' is 
requesting connection to any one of the comput rs In 
the server cluster. 

The further facility provided by the preferred m- 
bodim nt will be r ferr d to h r aft r as the "User 
Load L veller' (ULL) application. This application is 
r sp nslbleford ciding which s rver computer in th 
dust r is currently th I ast heavily load d, according 
to some appropriate m trie, and for conv ying this in- 
formation to the nam server application. Then sub- 
s qu nt requests for r solution of the g neric s rv r 
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computer name to an Internet address result in the 
nameserver application sending back to the client 
computer the Internet address of the server computer 
which has been deemed to be the most appropriate 
s rver computer for conn ction at that point in time. 5 

Th ULL application consists of the following ele- 
ments from figure 1 : the decision logic 120 with child 
processes 130. 140, 150; the writing means 160; and 
the messaging means 170. As described earlier with 
reference to Figure 1 the ULL application periodically io 
(at a frequency which can varied (eg. tuned by a sys- 
tem administrator or dynamically adjusted)) polls the 
server computers in the cluster to determine how 
"busy" in some sense they are. The metric used may 
vary, depending on the type of work which is being is 
handled by the cluster, but may for example include 
the number of login sessions, number of application 
Instances running, number of idle cpu cycles since the 
last poll, etc. The metric can be altered to ensure that 
it is appropriate to a specific situation. 20 

Based on the results of this polling, and taking 
into account the situation where a server computer in 
the cluster is too busy to respond to the status request 
within a certain number of seconds, the ULL applica- 
tion decides which machine is currently the least 25 
heavily loaded. The ULL application then modifies 
the database file (named.data) to associate the gen- 
eric cluster machine name with the Internet address 
of this least heavily loaded machine, and sends the 
special inter-process signal via the messaging means jo 
1 70 which tells the nameserver application to re-read 
its database file. The nameserver application will 
then, in response to a name resolution request from 
a client program, resolve the generic server computer 
name into the Internet address of the most appropri- 35 
ate server computer in the cluster for the client pro- 
gram to connect to. 

The process carried out by the decision logic 120 
of the preferred embodiment. will now be described in 
more detail with reference to Figure 2. At step 200 the 40 
ULL application is initialised. A number of steps are 
canried out at initialisation. For example the applica- 
tion: checks for multiple copies of the ULL application 
in memory; cleans up from a previous run of the ap- 
plication (by freeing up system resources such as 45 
memory, locks and semaphores still held in the name 
of the previous instance of the ULL application); and 
locates the nameserver application (named) and its 
data file (named.data). The ULL application then 
parses its configuration file to read information de- so 
fined by the system administrator, such as the metric 
strings, poll periods. Identities of server comput rs in 
th dust r. tc. Furth r the ULL application generates 
a number of "child** pr c sses - one p r server com- 
put r - which ar each r sponsibi for polling th ac- 55 
tivity of one designated serv r. 

One the initialisation has be n completed th 
process enters a main loop which x cut s until the 



ULL application is terminated. At steps 210. 220. 230 
and 240 the child processes send a metric string (as 
defined by the system administrator) to each server 
computer in the duster, await responses from those 
computers, and then wait for a trigg r signal from the 
main ULL application. 

Once the trigger signal has been sent by the main 
application the responses are sent by the child proc- 
esses to the main application. The main application 
then collates the activity results received from th 
child processes (step 250), and based on predeter- 
mined test criteria identifies the most appropriate ser- 
ver computer (the "least busy** server computer) at 
step 260. At step 270 it is determined whether the ser- 
ver computer identified at step 260 differs from the 
current nominated server computer. If it does then the 
process advances to step 290, at which point th 
nameserver*s data file (named.data) is modified. 
Further at step 300 a notification signal is sent to the 
nameserver application (named) to tell it to update its 
internal information from the data file. 

The process then proceeds to step 280. If at step 
270 it is determined that the server computer identi- 
fied at step 260 is the same as the current nominat d 
server computer then the process moves straight to 
step 280 without steps 290 and 300 being carried ouL 
Writing to and reading from the data file are time con- 
suming activities and so steps 290 and 300 should 
only be performed when necessary (ie when th 
"least busy" server computer changes). 

At step 280 the process waits until the end of the 
"poll period". This period is the predetermine^ interval 
(as defined by the system administrator) betw n 
successive studies of the server computers by the 
ULL application. Once the poll period has expired th 
process loops back to steps 210-240 and the main 
loop is repeated. 

Having discussed the preferred embodiment a 
few possible alterations will now be discussed. Firstly 
more than one generic server computer name can b 
added to the list in storage device 80 (the named.data 
file). Each generic name could be associated with a 
particular group of server computers, these groups 
being either completely separate or having a few s r* 
ver computers common to a plurality of the groups. 
Indeed one group may be a subset of another group. 
As an example consider Figure 3. Aserver duster 410 
comprises eight server computers 400. All eight ser- 
ver computers have access to a main body of data, 
but only four of them have access to some further 
(possibly more confidential) data. 

In this situation two generic names could be gen- 
rated, eg. "general.duster^ and "sp cificduster". 
Anyone of th ight computers (endosed by ring 430) 
can b associated with the former g n ric name, but 
only the f ur ndosed by ring 420 can be associat d 
with the latt r g neric name, since only those four 
have access t th further (confidential) data. 
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The system administrator can then set up the 
m trie string to be used wh n studying all eight serv r 
computers, or when studying only the four in ring 420; 
the metric string could be th same in b thinstanc s 
but need not be. If a client application needs access 5 
to the confidential information then it would request 
access to "specif icduster". but if an application only 
needed access to the general information then it 
would request access to "general.cluster". 

By this approach an application which only needs io 
access to the general information will always be con- 
nected to the least busy server computer whilst an 
application which needs access to the further (confi- 
dential) information will be given the machine address 
of the least busy server computer that can actually is 
provide the necessary service; this may or may not be 
the least busy server computer in the network. 

In preferred embodiments a further feature is 
provided to enable the decision logic to temporarily 
implement a "round-robin* metric instead of the 2o 
above described 'studying' process. The round-robin 
principle will be familiar to those skilled in the art; ba- 
sically when a client application requests access to a 
server computer it is assigned a particular server 
computer, and when the next request is received then 2S 
that application is assigned the next server computer 
in the cluster, and so on. In this way the server com- 
puters are rotated so that each successive server ac- 
cess is made on a different server computer to the 
previous server access. Alternatively the server com- 30 
puters can be rotated at fixed time intervals rather 
than after each access request. 

Although the round-robin technique does not 
have regard to the loading on any of the server com- 
puters, and so there is no determination of the least 35 
busy server computer, there are certain situations 
(eg. where there are lots of client applications which 
only take a short amount of database connection 
time) where a round-robin approach is acceptable. To 
implement the round robin approach the decision log- 40 
ic 120 would ask the writing means 160 to update the 
storage device 80 after each access request has 
been handled (or at fixed time intervals if the alterna- 
tive approach is used), so that the generic name is al- 
ways associated with successive server computers in 45 
the cluster in turn. 

From the above description it will be clear that the 
system of the preferred embodiment has a number of 
advantages. Firstly the technique dynamically allo- 
cates new client users and applications to the server so 
computer which is least heavily loaded at the time 
theymak the conn ction. thus nsuringan ven dis- 
tribution of users and applications across all of the 
available s rver computers. The client computer only 
briefly contacts th data processing system of the 55 
preferr d embodiment to res Iv the generic comput- 
er nam into a machine addr ss. Completely stan- 
dard access methods (eg. as provided by TCP/IP) are 



then used to make the connection, thus avoiding any 
proprietary protocols or any n ed to modify acc ss 
methods or applications, and so pr viding fully trans- 
parent user load levelling. 

Secondly the techniqu of the preferred embodi- 
ment does not involve any modification to the name- 
server code - the User Load Leveller application in- 
terfaces with the standard code (eg. "named" as ship- 
ped with the unix/AlX operating system). It would be 
possible to provide similar functionality to that descri- 
bed here by producing a modified version of the 
nameserver code. However, avoiding this brings ma- 
jor advantages from both marketing and maintenance 
points of view. 

Further the technique can be operated without re- 
quiring any modification to the server comput rs. 
They are accessed in a standard way after the gen- 
eric server computer name has been used to provide 
the client computer with a machine address. 

Another advantage is that the key parameters, 
such as the time interval between polls of the serv r 
computers in the cluster and the metric used to deter- 
mine which server computer is least heavily load d, 
can be altered and tuned by a local system adminis- 
trator, allowing the system to be optimised for a par- 
ticular situation. 

The above described ULL application could be 
supplied as a separate tool to enhance the useability 
of parallel and distributed systems, or could be ship- 
ped with the nameserver application. 



Claims 

1. A data processing system for facilitating a con- 
nection of a program on a client computer to a 
server, the server consisting of a plurality of s r- 
ver computers with shared resources, the data 
processing system, the client computer, and the 
server computers residing in a network, the sys- 
tem comprising: 

input means for receiving a request from the cli- 
ent computer for a machine address of a serv r 
computer identified by a server computer name 
sent with the request such a machine address 
enabling a connection to be made from the client 
computer to that server computer via the net- 
work; 

a storage device for storing a list identifying ser- 
ver computer names with machine addresses of 
the server computers; 

c nversi n means for using th listtoconv rtth 
s rver computer name received by the input 
means into the machine address of th serv r 
computer; 

output means for sending th machin address 
from the conversi n means to th client comput- 
n 
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the system being characterised by: 
decision logic for studying the server computers 
at predetermined intervals having regard to pre- 
determined t St criteria, in order to select one of 
the server computers; and 5 
writing m ans for updating th list by associating 
the machine address for the server computer se- 
lected by the decision logic with a particular ser- 
ver computer name contained as a generic server 
computer name in the list; to 
whereby when a client computer specifies the 
generic server computer name, it receives the 
machine address of the server computer identi- 
fied by the decision logic. 

IS 

2. A system as claimed in Claim 1 further compris- 
ing: 

a copier to copy the list from the storage 
device to a piece of memory accessible by the 
conversion means; 20 

a messaging means, responsive to the up- 
dating of the list by the writing means, for sending 
a message to the copier requesting the copier to 
copy the updated list into the piece of local mem- 
ory. 25 

3. A system as claimed in Claim 1 or Claim 2. where- 
in the predetermined test criteria are such that the 
decision logic identifies the server computer hav- 
ing the least number of client programs logged on 30 

to it. 

4. A system as claimed in any of claims 1 to 3. 
wherein the predetermined intervals are variable. 
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5. A system as claimed in any preceding claim, 
wherein the predetermined test criteria are set by 
a user of the system. 

6. A system as claimed in any preceding claim, 
wherein the user can limit the number of server 
computers which the decision logic studies. 



A system as claimed in any preceding claim 
wherein a plurality of generic names are used, 
each one having a number of server computers 
whose machine addresses are associated with 
that generic name, the decision logic employing 
different sets of predetermined test criteria for 
each generic name. so 
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plurality of s rver computers with shar d re- 
sources, the data processing syst m. th cli nt 
computer, and the server comput rs residing in a 
network, the method comprising th steps of: 

(a) receiving a request from the client comput- 
er for a machine address of a server computer 
id ntified by a server computer name sent 
with the request, such a machine address 
enabling a connection to be made from the cli- 
ent computer to that server computer via th 
network; 

(b) storing a list identifying server computer 
names with machine addresses of the server 
computers in a storage device; 

(c) converting, with reference to the list, the 
server computer name received at step (a) 
Into the machine address of the server conv 
puter; 

(d) sending the machine address identif i d at 
step (c) to the client computer; 

the method being characterised by the steps 
of: 

(e) employing decision logic to study the ser- 
ver computers at predetermined intervals 
having regard to predetermined test criteria, in 
order to select one of the server computers; 
and 

(f) updating the list by associating the ma- 
chine address for the server computer select- 
ed by the decision logic with a particular ser- 
ver computer name contained as a gen ric 
server computer name in the list; , 

whereby when a client computer specifies the 
generic server computer name at step (a), it r - 
ceives the machine address of the server com- 
puter identified by the decision logic. 

10. A method as claimed in Claim 9 further compris- 
ing the steps of: 

copying the list from the storage device to 
a piece of memory accessible at the conversion 
step (c); 

repeating, in response to the updating of 
the list at step (f). the copying step to ensure that 
the updated list is copied into the piece of local 
memory. 

11. A method as as claimed in claim 9 or claim 10. 
wherein the predetermined intervals are set by a 
user of the system. 



8. A syst m as claimed in Claim 7. wher in one or 
mor oftheserv r computers are associated with 
a plurality of the generic nam s. 

9. Am thod of operating a data processing system 
to facilitate a c nn ction of a program n a client 
computer to a server, th s rv r consisting of a 
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12. A method as claimed in any of claims 9 to 11. 
wherein the pred termin d test criteria are s tby 
a user of th system. 

13. A method as claimed in any of claims 9 to 12. 
wherein the user can limit th numb r of s rv r 
comput rs which the decision logic studi s. 
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14. A method as claimed in any of claims 9 to 13. 
wh rein a plurality of generic names are used, 
each one having a number of server computers 
whos machin addresses are associat d with 
that generic name, the decision I gic employing $ 
different sets of predetermined test criteria for 
each generic name. 

1 5. A method as claimed in Claim 14. wherein one or 
more of the server computers are associated with to 
a plurality of the generic names. 
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